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NTERED 



3 <110> APPLICANT: Coit, Doris 

4 Medina-Selby , Angelica 

5 Selby, Mark 

6 Houghton , Michael 



8 <120> TITLE OF INVENTION: NOVEL HCV NON- STRUCTURAL POLYPEPTIDE 
10 <130> FILE REFERENCE: PP01617.002 

12 <140> CURRENT APPLICATION NUMBER: US 09/721, 479B 

13 <141> CURRENT FILING DATE: 2000-11-22 
15 <160> NUMBER OF SEQ ID NOS : 19 

17 <170> SOFTWARE: Patentln Ver . 2.0 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 9620 

21 <212> TYPE: DNA 

22 <213> ORGANISM: Artificial Sequence 

24 <220> FEATURE: 

25 <221> NAME/KEY: CDS 

26 <222> LOCATION: ( 1990 )..( 7302 ) 

28 <220> FEATURE: 

29 <223> OTHER INFORMATION: Description of Artificial Sequence: Hepatitis C pns345 
31 <400> SEQUENCE: 1 

3 2 cgcgcgtttc ggtgatgacg gtgaaaacct ctgacacatg cagctcccgg agacggtcac 60 
34 agcttgtctg taagcggatg ccgggagcag acaagcccgt cagggcgcgt cagcgggtgt 120 
36 tggcgggtgt cggggctggc ttaactatgc ggcatcagag cagattgtac tgagagtgca 180 
3 8 ccatatgaag ctttttgcaa aagcctaggc ctccaaaaaa gcctcctcac tacttctgga 240 
40 atagctcaga ggccgaggcg gcctcggcct ctgcataaat aaaaaaaatt agtcagccat 300 
42 ggggcggaga atgggcggaa ctgggcgggg agggaattat tggctattgg ccattgcata 360 
44 cgttgtatct atatcataat atgtacattt atattggctc atgtccaata tgaccgccat 420 
46 gttgacattg attattgact agttattaat agtaatcaat tacggggtca ttagttcata 480 
48 gcccatatat ggagttccgc gttacataac ttacggtaaa tggcccgcct ggctgaccgc 540 
50 ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt tcccatagta acgccaatag 600 
52 ggactttcca ttgacgtcaa tgggtggagt atttacggta aactgcccac ttggcagtac 660 
54 atcaagtgta tcatatgcca agtccgcccc ctattgacgt caatgacggt aaatggcccg 720 
56 cctggcatta tgcccagtac atgaccttac gggactttcc tacttggcag tacatctacg 780 
58 tattagtcat cgctattacc atggtgatgc ggttttggca gtacaccaat gggcgtggat 840 
60 agcggtttga ctcacgggga tttccaagtc tccaccccat tgacgtcaat gggagtttgt 900 
62 tttggcacca aaatcaacgg gactttccaa aatgtcgtaa taaccccgcc ccgttgacgc 960 
64 aaatgggcgg taggcgtgta cggtgggagg tctatataag cagagctcgt ttagtgaacc 1020 
66 gtcagatcgc ctggagacgc catccacgct gttttgacct ccatagaaga caccgggacc 1080 
68 gatccagcct ccgcggccgg gaacggtgca ttggaacgcg gattccccgt gccaagagtg 1140 
70 acgtaagtac cgcctataga ctctataggc acaccccttt ggctcttatg catgctatac 1200 
72 tgtttttggc ttggggccta tacacccccg ctccttatgc tataggtgat ggtatagctt 1260 
74 agcctatagg tgtgggttat tgaccattat tgaccactcc cctattggtg acgatacttt 1320 
76 ccattactaa tccataacat ggctctttgc cacaactatc tctattggct atatgccaat 1380 
78 actctgtcct tcagagactg acacggactc tgtattttta caggatgggg tccatttatt 14 40 
80 atttacaaat tcacatatac aacaacgccg tcccccgtgc ccgcagtttt tattaaacat 1500 
82 agcgtgggat ctccgacatc tcgggtacgt gttccggaca tgggctcttc tccggtagcg 1560 
84 gcggagcttc cacatccgag ccctggtccc atccgtccag cggctcatgg tcgctcggca 1620 
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86 gctccttgct cctaacagtg gaggccagac ttaggcacag cacaatgccc accaccacca 1680 

88 gtgtgccgca caaggccgtg gcggtagggt atgtgtctga aaatgagctc ggagattggg 1740 

90 ctcgcacctg gacgcagatg gaagacttaa ggcagcggca gaagaagatg caggcagctg 1800 

92 .agttgttgta ttctgataag agtcagaggt aactcccgtt gcggtgctgt taacggtgga 1860 

94 gggcagtgta gtctgagcag tactcgttgc tgccgcgcgc gccaccagac ataatagctg 1920 

96 acagactaac agactgttcc tttccatggg tcttttctgc agtcaccgtc gtcgacctaa 1980 

98 gaattcacc atg get gca tat gca get cag ggc tat aag gtg eta gta etc 2031 



99 






Met Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu 




i Art 

100 








1 








5 








10 










102 


aac 


ccc 


tct 


gtt 


get 


gca 


aca 


ctg 


ggc 


ttt 


ggt 


get 


tac 


atg 


tec 


aag 


oat n 

207 y 


103 


Asn 


Pro 


Ser 


Val 


Ala 


Ala 


Thr 


Leu 


Gly 


Pne 


Gly 


Ala 


Tyr 


Met 


Ser 


Lys 




104 


15 










20 










25 














106 


get 


cat 


ggg 


ate 


gat 


cct 


aac 


ate 


agg 


ace 


ggg 


gtg 


aga 


aca 


att, 


acc 


2127 


107 


Ala 


His 


Gly 


Ti- 
ne 


Asp 


Pro 


Asn 


Tin 

i ie 


Arg 


i nr 


uiy 


Val 


Arg 


i nr 


Tin 

lie 


Thr 




lub 










35 










a n 
4 U 










A tr 






110 


act 


ggc 


age 


ccc 


ate 


acg 


tac 


tec 


ace 


tac 


ggc 


aag 


txc 


ctt 


gec 


gac 


zl/D 


111 


Thr 


Gly 


Ser 


Pro 


l le 


Thr 


Tyr 


Ser 


Thr 


Tyr 


biy 


Lys 


Pne 


Leu 


Ala 


Asp 




112 








50 










c cr 










bU 








114 


ggc 


ggg 


tgc 


teg 


ggg 


ggc 


get 


tat 


gac 


ata 


ata 


att 


tgt 


gac 


gag 


tgc 


n *i o o 


115 


Gly 


Gly 


Cys 


Ser 


Gly 


Gly 


Ala 


Tyr 


Asp 


lie 


lie 


He 


Cys 


Asp 


GlU 


Cys 




116 






65 










70 










75 










118 


cac 


tec 


acg 


gat 


gee 


aca 


tec 


ate 


ttg 


ggc 


att 


ggc 


act 


gtc 


ctt 


gac 


2271 


119 


His 


Ser 


Thr 


Asp 


Ala 


Thr 


Ser 


He 


Leu 


Gly 


He 


Gly 


Thr 


val 


Leu 


Asp 




120 




80 










85 










90 












122 


caa 


gca 


gag 


act 


gcg 


ggg 


gcg 


aga 


ctg 


gtt 


gtg 


etc 


gee 


acc 


gee 


acc 


2319 


123 


Gin 


Ala 


Glu 


Thr 


Ala 


Gly 


Ala 


Arg 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Ala 


Thr 




124 


95 










100 










105 










110 




126 


cct 


ccg 


ggc 


tec 


gtc 


act 


gtg 


ccc 


cat 


ccc 


aac 


at.c 


gag 


gag 


gtt 


get 


2367 


127 


Pro 


Pro 


Gly 


Ser 


val 


Thr 


vai 


Pro 


His 


Pro 


Asn 


He 


Glu 




vai 


Ala 




128 










115 










120 










125 






130 


ctg 


tec 


ace 


ace 


gga 


gag 


ate 


cct 


ttt 


tac 


ggc 


aag 


get 


ate 


ccc 


etc 


2415 


131 


Leu 


Ser 


Thr 


Thr 


Gly 


Glu 


He 


Pro 


Phe 


Tyr 


Gly 


Lys 


Ala 


He 


Pro 


Leu 




132 








130 










135 










140 








134 


gaa 


gta 


ate 


aag 


ggg 


ggg 


aga 


cat 


etc 


ate 


ttc 


tgt 


cat 


tea 


aag 


aag 


2463 


135 


Glu 


Val 


He 


Lys 


Gly 


Gly 


Arg 


His 


Leu 


He 


Phe 


Cys 


His 


Ser 


Lys 


Lys 




136 






145 










150 










155 










138 


aag 


tgc 


gac 


gaa 


etc 


gee 


gca 


aag 


ctg 


gtc 


gca 


ttg 


ggc 


ate 


aat 


gee 


2511 


139 


Lys 


Cys 


Asp 


Glu 


Leu 


Ala 


Ala 


Lys 


Leu 


Val 


Ala 


Leu 


Gly 


He 


Asn 


Ala 




140 




160 










165 










170 












142 


gtg 


gee 


tac 


tac 


cgc 


ggt 


ctt 


gac 


gtg 


tec 


gtc 


ate 


ccg 


acc 


age 


ggc 


2559 


143 


Val 


Ala 


Tyr 


Tyr 


Arg 


Gly 


Leu 


Asp 


Val 


Ser 


Val 


He 


Pro 


Thr 


Ser 


Gly 




144 


175 










180 










185 










190 




146 


gat 


gtt 


gtc 


gtc 


gtg 


gca 


ace 


gat 


gee 


etc 


atg 


acc 


ggc 


tat 


acc 


ggc 


2607 


147 


Asp 


Val 


Val 


Val 


Val 


Ala 


Thr 


Asp 


Ala 


Leu 


Met 


Thr 


Gly 


Tyr 


Thr 


Gly 




148 










195 










200 










205 






150 


gac 


ttc 


gac 


teg 


gtg 


ata 


gac 


tgc 


aat 


acg 


tgt 


gtc 


acc 


cag 


aca 


gtc 


2655 


151 


Asp 


Phe 


Asp 


Ser 


Val 


He 


Asp 


Cys 


Asn 


Thr 


Cys 


Val 


Thr 


Gin 


Thr 


Val 




152 








210 










215 










220 








154 


gat 


ttc 


age 


ctt 


gac 


cct 


ace 


ttc 


ace 


att 


gag 


aca 


ate 


acg 


etc 


ccc ♦ 


2703 
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155 Asp Phe Ser Leu Asp Pro Thr Phe Thr lie Glu Thr lie Thr Leu Pro 

156 225 230 235 

158 caa gat get gtc tec cgc act caa cgt egg ggc agg act ggc agg ggg 2751 

159 Gin Asp Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly 

160 240 245 250 

162 aag cca ggc ate tac aga ttt gtg gca ccg ggg gag cgc ccc tec ggc 2799 

163 Lys Pro Gly lie Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly 

164 255 260 265 270 

166 atg ttc gac teg tec gtc etc tgt gag tgc tat gac gca ggc tgt get 2847 

167 Met Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala 

168 275 280 285 

170 tgg tat gag etc acg ccc gec gag act aca gtt agg eta cga gcg tac 2895 

171 Trp Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr 

172 290 295 300 

174 atg aac acc ccg ggg ctt ccc gtg tgc cag gac cat ctt gaa ttt tgg 2943 

175 Met Asn Thr Pro Gly Leu Pro Val Cys Gin Asp His Leu Glu Phe Trp 

176 305 310 315 

178 gag ggc gtc ttt aca ggc etc act cat ata gat gec cac ttt eta tec 2991 

179 Glu Gly Val Phe Thr Gly Leu Thr His lie Asp Ala His Phe Leu Ser 

180 320 325 330 

182 cag aca aag cag agt ggg gag aac ctt cct tac ctg gta gcg tac caa 3039 

183 Gin Thr Lys Gin Ser Gly Glu Asn Leu Pro Tyr Leu Val Ala Tyr Gin 

184 335 340 345 350 

186 gec acc gtg tgc get agg get caa gee cct ccc cca teg tgg gac cag 3087 

187 Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin 

188 355 360 365 

190 atg tgg aag tgt ttg att cgc etc aag ccc acc etc cat ggg cca aca 3135 

191 Met Trp Lys Cys Leu lie Arg Leu Lys Pro Thr Leu His Gly Pro Thr 

192 370 375 380 

194 ccc ctg eta tac aga ctg ggc get gtt cag aat gaa ate acc ctg acg 3183 

195 Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin Asn Glu lie Thr Leu Thr 

196 385 390 395 

198 cac cca gtc acc aaa tac ate atg aca tgc atg teg gee gac ctg gag 3231 

199 His Pro Val Thr Lys Tyr lie Met Thr Cys Met Ser Ala Asp Leu Glu 

200 400 405 410 

202 gtc gtc acg age acc tgg gtg etc gtt ggc ggc gtc ctg get get ttg 3279 

203 Val Val Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu 

204 415 420 425 430 

206 gee gcg tat tgc ctg tea aca ggc tgc gtg gtc ata gtg ggc agg gtc 3327 

207 Ala Ala Tyr Cys Leu Ser Thr Gly Cys Val Val He Val Gly Arg Val 

208 435 440 445 

210 gtc ttg tec ggg aag ccg gca ate ata cct gac agg gaa gtc etc tac 3375 

211 Val Leu Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr 

212 450 455 460 

214 cga gag ttc gat gag atg gaa gag tgc tct cag cac tta ccg tac ate 3423 

215 Arg Glu Phe Asp Glu Met Glu Glu Cys Ser Gin His Leu Pro Tyr He 

216 465 470 475 

218 gag caa ggg atg atg etc gee gag cag ttc aag cag aag gee etc ggc 3471 

219 Glu Gin Gly Met Met Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly 
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220 




480 










4 o 5 






222 


etc 


ctg 


cag 


acc 


gcg 


tec 


cgt 


cag 


gca 


223 


Leu 


Leu 


Gin 


Thr 


Ala 


Ser 


Arg 


Gin 


Ala 




495 










5UU 








22o 


cag 


ace 


aac 


tgg 


caa 


aaa 


etc 


gag 


acc 


227 


Gin 


Thr 


Asn 


Trp 


Gin 


Lys 


Leu 


Glu 


Thr 


228 










c 1 c 

515 










25\J 


aac 


ttc 


ate 


agt 




a +- a 
ata 


caa 


4- o ^ 

tac 


4-4- rr 

ttg 


231 


Asn 


Phe 


lie 


Ser 


Gly 


l ie 


bin 


Tyr 


Leu 


232 








C O A 

530 










c q c 


2 34 


ggt 


aac 


ccc 


gec 


_ j_ 4_ 

att 


get 


4- _ 

tea 


ttg 


a 4- /r 

atg 


235 


Gly Asn 


Pro 


Ala 


lie 


» i _ 
Ala 


Ser 


Leu 


Met 


236 






545 










CCA 

550 




TOO 
Z JO 


age 


cca 


eta 


acc 


act 


age 


caa 


acc 


etc 


239 


Ser 


Pro 


Leu 


Thr 


Thr 


Ser 


Gin 


Thr 


Leu 


240 




560 










5o5 






^4 2 


tgg 


gtg 


get 


gee 


cag 


etc 


gee 


gee 


ccc 


243 


Trp 


Val 


Ala 


Ala 


Gin 


Leu 


Ala 


Ala 


Pro 


244 


575 










580 








z4 0 


ggc 


get 


ggc 


tta 


get 




gee 


gec 


ate 


247 


Gly Ala 


Gly 


Leu 


A 1 -a. 

Ala 


Gly 


Ala 


Ala 


lie 


248 










C CI c 

595 










z5u 


gtc 


etc 


ata 


gac 


ate 


si 4- 4- 
Ctt 


gca 


ggg 


tat 


251 


Val 


Leu 


lie 


Asp 


lie 


Leu 


Ala 


Gly 


Tyr 


252 








610 










015 


254 


ctt 


gtg 


gca 


ttc 


aag 


ate 


atg 


age 


ggt 


255 


Leu 


Val 


Ala 


Phe 


Lys 


He 


Met 


Ser 


Gly 


256 






625 










630 




ICO 
Z JO 


ctg 


gtc 


aat 


eta 


ctg 


ccc 


gec 


•a 4- r> 

ate 


etc 


259 


Leu 


Val 


Asn 


Leu 


Leu 


Pro 


Ala 


He 


Leu 


260 




640 










645 






i £ o 
zoz 


ggc 


gtg 


gtc 


tgt 


gca 


gca 


-i- -n 
ata 


ctg 


cgc 


263 


Gly Val 


Val 


Cys 


Ala 


Ala 


He 


Leu 


Arg 


264 


655 










660 








1 c c 


ggg 


gca 


gtg 


cag 


tgg 


atg 


aac 


egg 


ctg 


267 


Gly 


Ala 


Val 


p l « 

Gin 


Trp 


Met 


Asn 


Arg 


Leu 


268 










675 










270 


aac 


cat 


gtt 


tec 


ccc 


acg 


cac 


tac 


gtg 


271 


Asn 


His 


Val 


Ser 


Pro 


Thr 


His 


Tyr 


Val 


272 








690 










695 


274 


cgc 


gtc 


act 


gee 


ata 


etc 


age 


age 


etc 


275 


Arg 


Val 


Thr 


Ala 


He 


Leu 


Ser 


Ser 


Leu 


276 






705 










710 




278 


cga 


ctg 


cac 


cag 


tgg 


ata 


age 


teg 


gag 


279 


Arg 


Leu 


His 


Gin 


Trp 


He 


Ser 


Ser 


Glu 


280 




720 










725 






282 


tec 


tgg 


eta 


agg 


gac 


ate 


tgg 


gac 


tgg 


283 


Ser 


Trp 


Leu 


Arg 


Asp 


He 


Trp 


Asp 


Trp 


284 


735 










740 









490 



gag 


/T 4- 4- 

gtt 


_ 4- _ 
ate 


gee 


cct 


get 


sy4- /-> 
gtc 


35iy 


G1U 


vai 


Tl Q 

1 ie 


Aia 


Pro 


Ala 


vai 






^ n ^ 

jUj 










o±u 




4- 4- 

ttc 


tgg 


gcg 


aag 


cat 


atg 


tgg 


"3 R C 7 
350 / 


pne 


Trp 


jl a 

Aia 


Lys 


rllS 


Met 


Trp 




jZU 










D ZD 






gcg 


ggc 


4- 4- rr 

ttg 


tea 


acg 


ctg 


cct 


3 0 JL O 


Ala 

Ala 


uiy 


Leu 


Cor 

ber 


T 1 Vl T- 

1 nr 


Leu 


Pro 




















get 


4-4-4- 
ttt 


aca 


get 


get 


gtc 


acc 


"5 £ £ "3 
j DD j 


Ala 


pne 


Thr 


Ala 


Aia 


Va 1 

vai 


Thr 










555 










etc 


ttc 


aac 


ata 


4- 4- rt 
ttg 


ggg 


ggg 


3/11 


Leu 


Phe 


Asn 


He 


Leu 


pi 

Gly 


pi 

Gly 








570 












ggt 


gec 


get 


act 


gee 


4-4-4- 

ttt 


gtg 


Q 7 C O 

3 / 5y 


Gly 


Ala 


Ala 


Thr 


a 1 a 

Aia 


rile 


A7a 1 

vai 






^ q ^ 










Jj U 




ggc 


agt 


gtt 


gga 


ctg 


ggg 


aag 


Q q n 7 
3 O U / 


Gly 


Ser 


Val 


Gly 


Leu 


Gly 


Lys 




c a n. 










DUD 






ggc 


gcg 


ggc 


gtg 


gcg 


gga 


get 


3855 


Gly 


Ala 


Gly Val 


Ala 

Ala 


Gly 


Ala 












ozv 








gag 


gtc 


ccc 


tec 


acg 


gag 


gac 


*3 Ci A O 

3903 


Glu 


Val 


Pro 


Ser 


Thr 


Glu 


Asp 










635 










teg 


ccc 


gga 


gee 


etc 


gta 


gtc 


O A C 1 

3951 


Ser 


Pro 


Gly 


Ala 


Leu 


Val 


Val 








650 












egg 


cac 


gtt 


ggc 


ccg 


ggc 


gag 


3 y y y 


Arg 


His 


Val 


Gly 


Pro 


Gly 


Glu 






c c c. 

ODD 










670 




ata 


gee 


ttc 


gec 


tec 


egg 


ggg 


A A A 7 
4 U4 / 


He 


Ala 


Phe 


Ala 


Ser 


Arg Gly 




680 










685 






ccg 


gag 


age 


gat 


gca 


get 


gee 


4095 


Pro 


Glu 


Ser 


Asp 


Ala 


Ala 


Ala 












700 








act 


gta 


acc 


cag 


etc 


ctg 


agg 


4143 


Thr 


Val 


Thr 


Gin 


Leu 


Leu 


Arg 










715 










tgt 


acc 


act 


cca 


tgc 


tec 


ggt 


4191 


Cys 


Thr 


Thr 


Pro 


Cys 


Ser 


Gly 








730 












ata 


tgc 


gag 


gtg 


ttg 


age 


gac 


4239 


He 


Cys 


Glu 


Val 


Leu 


Ser 


Asp 






745 










750 
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one 

ZOO 




aag 


acc 


tgg 


p-t-» 
\^ La 


aaa 


get 


aag 


etc 


o -H rr 
ct i_y 


eca 


cag 


ctg 


cct 


999 


a +■ p 

d to 


L 9 ft7 
izo / 


O Q 7 
ZO / 


Phe 


Lys 


Thr 


Trp 


Leu 


Lys 


a j. a 


Lys 


Leu 




Pro 


n n 


Leu 


Pro 


PI T, 

uiy 


Tl A 

lie 




TOO 

zoo 










/ JJ 










7 £ A 
/ OU 










/Dj 






z y U 


ccc 


ttt 


gtg 


tec 


tgc 


cag 


cgc 


999 


La t 


aag 


999 


gtc 


tgg 


cga 


999 


gac 


A ^ ^ R 

*4 J J J 


zy l 


Pro 


Phe 


Val 


Ser 


Cys 


p i « 
bin 


Arg 


(jiy 


Tyr 


Lys 


<j»iy 


vai 


Trp 


Arg 


pi „ 
uiy 


Asp 
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